Corpus: ces_news_2013_1M

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 98 98 98 98 98
1000 881 963 975 984 987
10000 7455 9565 9856 9921 9936
100000 41071 84049 95693 98569 99166
1000000 172790 633833 866654 949999 974527


Zipf's diagram for sentence endings


Gnuplot diagram

145591 msec needed at 2017-11-05 08:42